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Abstract: 

This paper introduces a new semi-parametric approach to the pricing and risk management of 
bespoke CDO tranches, with a particular attention to bespokes that need to be mapped onto 
more than one reference portfolio. The only user input in our framework is a multi-factor model 
(a "prior" model hereafter) for index portfolios, such as CDX.NA.IG or iTraxx Europe, that 
are chosen as benchmark securities for the pricing of a given bespoke CDO. Parameters of the 
prior model are fixed, and not tuned to match prices of benchmark index tranches. Instead, 
our calibration procedure amounts to a proper reweightening of the prior measure using the 
Minimum Cross Entropy method. As the latter problem reduces to convex optimization in a 
low dimensional space, our model is computationally efficient. Both the static (one-period) 
and dynamic versions of the model are presented. The latter can be used for pricing and 
risk management of more exotic instruments referencing bespoke portfolios, such as forward- 
starting tranches or tranche options, and for calculation of credit valuation adjustment (CVA) 
for bespoke tranches. 
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1 Introduction 



It is well known that virtually all models for pricing and risk management of derivative securities 
are interpolators of different degrees of sophistication. Given observed market prices of various 
derivatives, the problem of pricing a new, illiquid security, whose price is not available from 
the marketplace, amounts to first finding benchmark traded (liquid) derivatives most similar 
to the one at hand, and then providing a rule by which their prices should be interpolated (and 
possibly extrapolated) in order to price our security. 

The second step is where one needs a model. Complexity of the model depends on the 
instrument one needs to price, and on the market. In some cases, the model can be as simple as a 
one-dimensional spline (or even linear) interpolation. This is the case when e.g. an equity option 
with a given strike and maturity is priced by interpolating implied volatilities of benchmark 
traded options on the same underlying and with the same maturity. Note that generally such 
interpolation is done not directly in the price space, but in some "model parameter space", 
a procedure that should properly handle constraints imposed by the no-arbitrage principle, 
and also account for stochasticity of underlying price processes. In particular, as long as the 
underlying for both the illiquid and liquid options is the same, pricing by interpolation is 
relatively straightforward once one specifies a stochastic model for the underlying, calibrates 
it, and identifies relevant parameters that differentiate between the benchmark and the target 
derivatives. 

In the case of correlation-dependent portfolio credit derivatives, the role of benchmark 
securities is played by standardized tranches referencing standard portfolios, called credit index 
portfolios, such as CDX.NA.IG or iTraxx Europe. Both these portfolios consist of synthetic 
exposures (in the credit default swap format) to 125 liquidly traded investment grade corporate 
obligors ("names"), added with equal weights. Index tranches are swap contracts covering 
portfolio losses between an attachment and detachment points (commonly referred to as strikes) 
that are expressed as a percentage of the total portfolio notional. Standardized strikes are 0, 
3, 7, 10, 15, 30, 100 % for CDX.NA.IG index, and 0, 3, 6, 9, 12, 22, 100 % for iTraxx Europe 
index, and standard tradeable maturities are 5, 7, 10 and (to a lesser extend) 3 years. Other 
standard portfolios are somewhat less liquid, but still are more or less actively traded in the 
market. In particular, CDX.NA.HY index has 100 US non-investment grade ("high yield") 
names, with standardized strikes of 0, 10, 15, 25, 35 and 100 %. 

Market participants use quoted prices of standardized index tranches in order to estimate 
prices of other correlation-dependent derivatives, such as cash CDO tranches, non-standard 
tranches referencing standard index portfolios with strikes or/and maturities different from 
those actively traded, or customized (bespoke) synthetic tranches whose reference portfolios 
differ in their composition from credit index portfolios. More specifically, as prices of these 
instruments are driven by both observable (CDS prices) and unobservable parameters (in par- 
ticular, parameters determining default dependencies under the risk-neutral measure, such as 
asset correlations in the Gaussian copula model), practitioners typically use market prices to 
estimate the latter set of parameters using a specific model, and then use (and possibly interpo- 
late/extrapolate) these parameters within the same model to price the instrument in question. 

In this paper, we address the problem of pricing sythnetic bespoke CDO tranches by unrav- 
eling of information contained in the market prices of benchmark index tranches. In the Street 
jargon, this is often referred to as "bespoke mapping problem", implying that a given bespoke 
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tranche is "mapped" onto index tranches. Obviously, this problem can be viewed as another 
example of pricing by interpolation. On the other hand, it is clear that, as long as composition 
of a bespoke portfolio is different from that of an index portfolio, the bespoke mapping problem 
is considerably more complicated than the above problem of pricing an illiquid equity option 
by interpolation of Black-Scholes implied volatilities. The reason is that the present case re- 
quires not only interpolation across strikes and/or maturities, but should also somehow involve 
interpolation/extrapolation across the underlying. 

As the Gaussian copula with base correlations (and a possible extension to random recov- 
ery) continues, in spite of its well known drawbacks, to serve as the current market standard, 
practitioners usually pose the bespoke mapping problem as the base correlation mapping prob- 
lem. The idea here is to find proper correlation parameters for a bespoke tranche at hand by 
a suitable adjustment of market-implied base correlations for standard index tranches, with an 
adjustment designed to account for differences between the index and bespoke portfolios. 

While various base correlation mapping rules are used by practitioners, these rules are ad 
hoc and lack a theoretical or empirical justification. Worse yet, the base correlation method is 
theoretically inconsistent, and occasionally violates no-arbitrate constraints in practice, even 
when applied to a simpler problem of pricing non-standard tranches referencing the same index 
portfolio. Furthermore, being a one-factor model, the base correlation approach does not prop- 
erly addresses the sector concentration risk, and thus cannot be expected to provide thustworthy 
results as long as composition of a bespoke portfolio is materially different from that of the ref- 
erence index portfolio. More details on the base correlation mapping rules and their drawbacks 
will be given below. 

In this paper we develop a consistent and practically oriented model for pricing despoke 
CDOs and other portfolio credit derivatives. We specifically concentrate on the case of bespokes 
that have to be mapped onto more than one reference index, which is often the case in practical 
settings, where a given bespoke portfolio can include e.g. both US and European names, or both 
investment and non-investment grade (IG and HY, respectively) names. Comparing to bespokes 
that need only be mapped onto one reference index, the latter case is more complex and produces 
higher modeling uncertainty. At the same time, it is also more computationally demanding, as 
it calls for a multi-factor framework (see below) in which one has to simultaneously calibrate 
to tranches written on all reference indices. 

Our framework combines several modeling concepts, and generally belongs in the class of 
"implied distribution" models that have gained in popularity in recent years. We start with a 
bottom-up view of an index portfolio within a factor framework with a multivariate "market 
factor" Z, where individual defaults are independent conditional on the value of Z. Any 
arbitrage-free factor model of conditional independence type can be used here, examples are 
discussed below. This model is referred to as the prior model. At the next step, we departure 
from the usual bottom-up framework in two aspects. First, we assume that parameters in our 
prior model are fixed once and for all, i.e. we do not fit this model to available data. Second, 
we give up the fine resolution of the portfolio loss into contributions of individual obligors, and 
instead bucket all names in the index portfolio (and their respective losses) into two groups of 
names (sub-portfolios) according to their membership degree in the bespoke portfolio^] In this 
way we construct a dichotomic representation of index portfolio losses as a joint loss distribution 

1 I.e. the first group includes all names from the index portfolio that enter the bespoke portfolio, and the 
second group includes all the rest. 
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of two sub-portfolios. As an avatar of the factor framework we have started with, losses in the 
sub-portfolios are independent conditional on the market factor. This representation serves as 
a single object of further analysis. 

In the second step, we calibrate our model by finding a minimal functional distortion of the 
prior dichotomic loss distributions of index portfolios needed in order to match observed prices 
of tranches referencing these portfolios. This is done within a semi-parametric framework based 
on the Minimum Cross Entropy (MCE) method, where the number of free parameters is data- 
driven, and is equal to the number of tradable tranches. The resulting joint loss distribution is 
then used to price tranches on a bespoke portfolio. Details of this procedure will be introduced 
in Sect. 3. 

The justification for taking a top-down view of the portfolio loss process is three-fold. 
First, market incompleteness imposes severe restrictions on the extent to which the "true" 
(market-implied) risk-neutral measure can be learned. For example, for a standard portfolio 
like CDX.NA.IG or iTraxx Europe, the only available source of information about risk-neutral 
dependencies are tranche prices (6 quotes per maturity), which is not sufficient to infer the pric- 
ing measure in a unique way. The only way out is to model it using a low dimensional space 
of adjustable parameters. With our MCE approach, such a parametrization is constructed 
directly in the loss space, no-arbitrage relations are satisfied by construction, and calibration 
amounts to convex optimization)^} Second, as will be discussed in more details in Sect 2.4, the 
knowledge of dichotomic loss distributions of index portfolios is sufficient to price tranches on 
bespoke portfolios made of arbitrary compositions of " chunks" of index portfolios, and can be 
readily generalized to other types of bespoke portfolios which involve names not belonging to 
any index. Third, our approach is computationally more efficient than a bottom-up one which 
can become quite computationally intense in a typical setting of pricing bespoke tranches that 
need be mapped onto more than one index portfolio. 

The rest of this paper is organized as follows. The reminder of this introduction discusses 
base correlation mapping methods and explains the relation of our approach to the previous 
literature. In Sect. 2 we provide a high-level qualitative overview of key elements of our modeling 
framework. Sect. 3 provides technical details of the Minimum Cross Entropy (MCE) calibration 
scheme. Information-theoretic aspects of our problem are discussed in the appendix. In Sect. 4 
we present a dynamic version of the model. Sect. 5 deals with generalization and extensions. 
Numerical examples are considered in Sect. 6. The final Sect. 7 concludes. 

1.1 Base correlation mapping methods 

A one-factor Gaussian copula model developed by Li [TS] continues to be the market standard 
model. As in its original form the model is uncapable of matching market prices, practitioners 
use it in a way similar to the way the Black-Scholes model is used in other markets. Namely, 
traders convert market prices of index tranches into what is called base correlations. The 
latter are defined in two steps. First, one converts market prices into prices of synthetic equity 
tranches (base tranches) covering losses from % to K%, where K is one of the standard strikes 
for the index under consideration. Second, base correlation j3(K) for strike K is found as the 
correlation parameter that should be used by the Gaussian copula model in order to match the 

2 Note that calibration of most of bottom-up models amounts to non-convex optimization with multiple local 
minima, which in practice often leads to unstable calibration and hedging. 
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price of base tranche with strike K. Market prices typically imply an increasing "correlation 
skew" function (3{K). 

Within the Gaussian copula/base correlation framework, the problem of pricing bespoke 
tranches amounts to calculating base correlations for a bespoke portfolio from base correlations 
for an index portfolio. This is achieved by first calculating strike K t of the index portfolio that 
is "equivalent" (in a sense defined below) to strike K b of the bespoke tranche, and then using 
interpolation of base correlations for standard strikes in order to find (3{Ki). Several methods 
are used by traders to define what is meant by such "equivalence" : 

• Absolute strike rule (no mapping): Here to price a bespoke tranche with strike Kb, we 
use the base correlation for the same strike, i.e. our rule is Ki = K b . Such a naive rule is 
seldom used as it stands, as the probabilities to reach the same strike for a bespoke and 
index portfolio can be very different if e.g. their average spreads are vastly different. 

• ATM mapping rule: Here one assumes that the observed correlation skew /3(t, K, L, . . .) 
(written here as a general function of calendar time t, strike K, expected portfolio loss L 
and possible other variables) is a function of t and a single dimensionless ratio ( "money- 
ness") x = K/L: 

/3(t,K,L,...)=p(t,x=*) (1) 

(Note that if the skew is a function of only t, K and L, then the ansatz Q follows simply 
on the dimensional grounds.) 

Given expected losses L b and Lj of the bespoke and index portfolios, respectively, the two 
portfolio will have the same (3, and hence the same risk, when they have the same value 
of x. This yields the following relation for the index strike Ki that is "equivalent" to a 
given strike Kb of the bespoke portfolio: 

Li{ Lib 

It is interesting to note that ansatz ([T| has some dynamic implications for the behavior 
of the correlation skew. Indeed, let us calculate its partial derivatives: 

dl \_dJS_ 

dK Ldx U 
d£ K 8(3 

dL ~ ~L^dx~~~TdK 

If the skew is upward sloping, then the second of Eqs.([3]) implies that d(5/dL < 0, i.e. the 
skew moves downward as the expected portfolio loss (and hence the par portfolio spread) 
increases. 

If we assume that the correlation skew (3(t,x) is a slowly changing function of its first 
argument^ i.e. its changes are largely driven by changes of x, then the latter prediction 



3 Equivalently, we assume that the characteristic scale on which (3(t, x) changes as a function of its first 
argument is of order of the length of an economic cycle, i.e. a few years. Note that this is the same assumption 
that is tacitly made anyway in using single-period models such as CreditMetrics/ Gaussian copula model for 
pricing CDO tranches. 
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Figure 1: Correlation skew for CDX.NA.IG11 as a function of moneyness. Expected portfolio 
loss is 9.72 % on 04/15/09 and 9.16 % on 05/15/09. 



can be tested using the actual data. In Fig. [T] we show the profile of the correlation skew 
of the CDX.NA.IG11 index portfolio for 04/15/09 and 05/15/09. One sees that while the 
index spread goes down, the skew moves down as well. We conclude that the ATM rule 
(|2| is likely to be in conflict with empirical data. 

What is missing in Eq.(|2])? Note that the strike adjustment according to ^ is determined 
only by the portfolio expected loss and not e.g. higher moments of the loss distribution. 
This may appear counter-intuitive. Indeed, comparing two portfolios having e.g. the 
same average spread but different spread dispersion, we expect that the more dispersed 
portfolio should have higher risk under nearly any reasonable risk metric]^] 

• Probability matching rule: Ki is determined from the condition that cumulative prob- 
ability of losses reaching strikes Kb and Ki for the bespoke and index portfolios should 
match: 

Pr{L i <K i ) = Pr{L b <K h ) (4) 

This prescription takes higher moments of the loss distributions for the bespoke and index 
portfolio into account. However, the problem now is that the right-hand side (RHS) of 
(Q is only defined as long as the loss distribution for the bespoke is known, but the latter 
is exactly what we want to find in the first place! This means that on its own, Eq.Q is 
incomplete, and thus potentially has an infinite number of solutions. The way the rule Q 
is used in practice is to assume that its RHS can be computed using CreditMetrics/Base 
Correlation model with correlation /3(fQ). When interpreted in this way, Eq.(|4]) becomes 
a highly non-linear equation for Ki, whose fixed point determines both the equivalent 
strike and the pricing measure for the bespoke. However, it is important to realize that 
this assumption is ad-hoc, and may lead to a sub-optimal and/or biased solution of the 

In terms of the dimension analysis , we can expect that, in addition to x = K/(L), correlation can depend 
on K/^J (L 2 ) — (L) 2 and other dimensionless combinations. 
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bespoke mapping problem^} As a practical problem, the probability matching method 
does not always have a solution for bespokes with wide spreads. 

While other similar recipes of base correlation mapping are available in the literature, see 
e.g. [24] , the main problem with all these methods is that they are ad hoc and do not have a solid 
theoretical or empirical basis. In practice, their use leads at times to violation of no-arbitrage 
relations for bespoke portfolios across strikes or maturities. 

Further problems with the use of base correlation mapping rules have to do with market 
incompleteness. Market prices of standard tranches provide only information on average risk- 
neutral default dependencies for a well-diversified (across different industrial sectors) index 
portfolio, and not information about default correlations specific to different industrial sectors. 
This implies that the base correlation method within a one-factor Gaussian copula framework 
does not capture sector concentration risk, and thus can be potentially dangerous to use for 
bespoke portfolios whose sector or/and geographical composition is materially different from 
that of standard portfolios. For a further discussion of practical difficulties arising with the use 
of base correlation approach for pricing bespoke tranches, see e.g. [21], [15] and [20] . 

1.2 Relation to previous literature 

Our model combines elements of both bottom-up and top-down approaches. Here we would 
like to give a brief account of literature most relevant to our approach. 

In a typical bottom-up model, we start at the view of a credit portfolio as a collection 
of separate exposures ("single names"), whose default dependencies are then introduced via 
a factor copula in structural models, or via a multi-variate intensity process. Any consistent 
multi-factor bottom-up model, with parameters tuned to roughly match the observed data, can 
be used as the "prior" model in our approach. Examples inlcude, in particular, the Gaussian 
copula model of Li [18], a multi- factor version of the RFL model [I], or reduced-form models, 
see e.g. Duffie and Garleanu [7J, or Chapovsky et al [5]. 

Once the prior model is chosen, we switch to a top-down paradigm, concentrating on the 
dichotomic representation of portfolio losses (see above), and abstracting from the initial single- 
name picture. The idea of modeling credit portfolios in a top-down manner is originally due to 
Giesecke and Goldberg [8], Sidenius, Peterbarg and Andersen [22], and Schonbucher [21]. Our 
approach resembles the BSLP model of Arnsdorf and Halperin [2] in the sense that transition 
from the initial ("prior") distribution to a "true" (calibrated) one is achived via a set of mul- 
tiplicative loss-dependent factors applied directly to the portfolio loss distribution, though a 
particular realisation of this idea in the present framework is different. 

Effectively, our procedure amounts to constructing loss distributions for both the index and 
bespoke portfolio in a way implied by observed prices of standard tranches. Thus our method 
belongs in the class of implied loss distribution approaches which have become quite popular 
among practitioners in recent years. Examples of such approach include e.g. the implied copula 
model of Hull and White [15], [16], or the factor model of Inglis and Lipton [17]. Unlike these 
authors, we work in a multi-factor setting that starts with a usual factor model, and consistently 

5 More specifically, we assert that the bespoke pricing measure obtained with this prescription is sub-optimal 
according to the information-theoretic MCE method which allows to chooce a "best" bespoke measure among 
possible solutions of (E|, see Sect. 3. 
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apply information-theoretic methods in order to construct implied loss distribution, while e.g. 
Hull and White adopt smoothness criteria as the main tool in their method. 

In employing a multi-factor framework and using information-theoretic (entropy-based) 
methods for calibration, our approach is akin to that taken by Rosen and Saunders [20]. They 
develop an entropy-based calibration method for a class of multi-factor bottom-up models, 
where adjustment to market prices amounts to calculation of implied market factor distribu- 
tion. Unlike the latter authors, we use a top-down entropy calibration where we calibrate to 
losses in tranches and suitably chosen sub-portfolios (see below), but not to individual names. 
This choice is made for the sake of simplicity and efficiency of implementation, and to facilitate 
an easy transition to a dynamic setting. On the implementation side, Rosen and Saunders 
employ a Monte Carlo scheme which is a preferred method when the number of market factors 
is three or more, while in our model we settle for a practically-oriented two-factor framework)^ 
and design an efficient lattice-based scheme for calibration and pricing, which is comparable in 
performance to the BSLP model of Ref. [2] . The main difference of our present framework from 
more traditional top-down approaches is that for pricing of bespoke tranches, we need a finer 
resolution of portfolio loss scanarios, which is achieved via calculation of joint loss distributions 
of sub-portfolios of credit indices. In concentrating on the dynamics of losses in sub-portfolios 
of index portfolios, our approach is similar in spirit to a multi-portfolio top-down model of Zhou 
[27] (see also [12]). In order to calculate loss distributions of sub-portfolios, Zhou relies on the 
random thinning technique similar to that of Giesecke and Goldberg [8], and Halperin and 
Tomecek [13] . However, his model does not employ a factor framework, which in our opinion is 
very useful, both conceptually and computationally, for modeling bespoke portfolios that have 
to be mapped onto more than one index portfolio. The approach taken in the present paper 
(both the parametrization and calibration method) is different from those used in [2], [27] and 

P-fl- 

2 Implied Multi-Factor Model at a glance 

In this section we provide a high level, qualitative overview of our approach. All technical 
details are left for the next section. Here our task is to introduce different key components 
of our framework and explain how they help to address various deficiencies of more tradi- 
tional approaches to credit portfolio modeling in general, and the bespoke mapping problem in 
particular. 

2.1 No-arbitrage 

No-arbitrage conditions ensure that losses can only increase over time, and are clearly among 
the most important requirements for a consistent bespoke model. The way no-arbitrage is 
enforced depends on the model. For example, in a dynamic model it can be imposed on the 
loss process, while for single-period models, it is usually formulated as conditions on expectations 
of future losses as functions of the time horizon and loss level. In particular, expected loss for a 
tranche or a portfolio should increase with the time horizon, ensuring no-arbitrage across time. 

6 As discussed by Rosen and Saunders [20] . a practical difficulty with using more market factors is that 
market prices convey virtually no information on distributions of sector-specific factors. 



7 



No-arbitrage across loss levels is enforced as long as the expected loss for first loss (equity) 
tranche is an increasing and concave function of the strike. This ensures that the portfolio loss 
distribution is non-negative at any loss level. Recall here that the base correlation methodology, 
the current market standard, leads to occasional violations of no-arbitrage in both the strike 
and time dimension. Such failure to ensure no-arbitrage can be traced back to the fact that in 
the base correlation approach, bespoke tranches are priced by interpolation/extrapolation in 
the "wrong" space (the correlation space), where conditions of no-arbitrage are hard to check or 
enforce. On the contrary, in our model no-arbitrage across strike is ensured by the fact that the 
bespoke price is calculated directly in the loss space using probabilistic arguments without any 
need of interpolation in an auxiliary correlation space, with the loss density being non-negative 
by construction as long as our "prior" model Q is arbitrage-free. This follows simply from 
the fact that in our model the "true" measure P is found by minimization of a information- 
theoretic "distance" (KL-divergence, see below in Sect. 3) between P and Q, which is finite as 
long as two measures P and Q are equivalent. Therefore, as long as the prior distribution Q is 
arbitrage-free, the "true" distribution P obtained using minimization of the KL-divergence is 
guaranteed to be arbitrage-free as well. 

The question of arbitrage across time is somewhat more subtle. Our model comes in two ver- 
sions: one-period and multi-period. In the former, no-arbitrage is guaranteed by construction 
across strikes, but not necessarily across time (though the latter is found to hold a posteriori in 
our numerical experiments). The dynamic version is free of arbitrage across both strikes and 
time by construction. No-arbitrage across time is ensured as long as we pick an arbitrage-free 
prior model for the portfolio loss process, see below. 



2.2 Multi-factor structure 

Most of the models currently in use in the industry are one-factor models, where default de- 
pendence between names in a portfolio arises due to dependence of individual defaults on a 
one-component "market" factor Z that is common for all names in the portfolio. Such ap- 
proach, expressing default dependencies as a result of dependence on a single common factor, 
may be reasonable for a well-diversified portfolio such as CDX.IG.NA. 

However, it is by no means clear that the same one-factor framework can be applied to 
bespokes whose composition is materially different from that of an index portfolio, like our 12- 
type bespokes introduced above. It appears that for such bespokes, a more realistic framework 
that takes into account diversification effects (i.e. qualitative differences in portfolio composi- 
tion), should include several market factors. As a minimum requirement, we should have two 
(dependent) factors Z\ and Z 2 , that could be thought of as a "US factor" and "Europe factor", 
or a "geographic factor" and "industry factor", see Fig. |2j A two-factor framework thus seems 
to provide a minimal complexity level for a bespoke model needed for all but simplest bespoke 
portfolios that only need to be mapped onto one credit index. Given that market-implied 
prices of standard index tranches contain virtually no information on factors corresponding to 
separate sectors beyound a single factor corresponding to a diversified credit index as a whold^J 
we believe that a two-factor specification may simultaneously serve as the upper "complexity 
bound" for a bespoke model to be useful and manageable in practice^} We therefore stick to a 

7 See Rosen and Saunders [2U] for a discussion on this point. 

8 More factors might be needed for more complex bespokes referencing more than two credit indices, e.g. for 
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Figure 2: Layout of the Implied Multi-Factor Model (IMFM). 



two-factor formulation in what follows, and leave generalization for Sect. 5. 

2.3 Types of bespokes that can be priced with IMFM 

As mentioned above, we want to be able to price arbitrary bespoke tranches which, by nature 
of their composition, should be mapped onto more than one index portfolio. We achieve this by 
constructing a certain hierarchy of "bespoke complexity". We first address a simplest bespoke 
portfolio: a bespoke that is made of "chunks" of index portfolios. Assume we have two index 
portfolios Hi and n 2 corresponding e.g. to the CDX.NA.IG and iTraxx Europe Main credit 
portfolios. Our simplest bespoke portfolio is made of names making up certain sub-portfolios 
(hereafter "relevant" sub-portfolios) ITn and H21 of index portfolios, respectively, while all 
names from complement sets II12 = and H22 = n^IT^i will be collectively referred to 

as "complement" sub-portfolios (see Fig. |2|. We will refer to such bespokes made of names 
from two, three etc. indices as 12, 13, . . . portfolios. One example of a 12 portfolio would be 
a bespoke portfolio with 100 names with 50 US names from the CDX.NA.IG index portfolio, 
and 50 European names from iTraxx Europe index portfolio. 

Analysis in the following Sects. 3 and 4 will concentrate on the pricing of 72-type bespoke 
portfolios. This setting forms our basic case which will be worked out in great details. Treat- 
ment of more complex bespoke portfolios (e.g. portfolios made of chunks of three and more 
indices, or portfolios containing names not belonging to any index) will be presented as suitable 
generalizations of our basic setting in Sect. 5. 

/3-type bespokes we would need three factors, etc. 
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2.4 Other applications: credit exotics and CVA on bespoke tranches 

In addition to the ability to price bespoke CDO tranches on portfolios mapped onto more 
than one credit index, our model has, for its dynamic version, other interesting applications as 
well. First, more exotic portfolio derivatives, such as e.g. tranche options or forward-starting 
tranches, can be priced using a loss lattice with a backward recursion method, similar to how it 
is done in the BSLP model [2]. Second, the dynamic version of our multi-factor framework can 
be used for counterparty risk management of CDO tranches, in particular, for calculation of 
credit valuation adjustments (CVAs) on bespoke tranches. Again, a lattice formulation enables 
a quick and efficient implementation of the computational algorithm for this case. 

2.5 Computational efficiency: a top-down approach 

A multi-factor credit portfolio model of a traditional buttom-up type can run into numerical 
challenges rather quickly. Indeed, in consistent models for tranche prices such as RFL (which is 
a single-factor model), calibration takes around 5-30 min, depending on the model, efficiency of 
implementation and the particular calibration dataset. When moving to a two-factor framework 
with a double number of names, calibration time is expected to grow by at least an order of 
magnitude^] , which might become computationally infeasible on a single PC. 

We choose to address these potential issues by adopting a version of the top-down approach 
to credit portfolios modeling. To introduce our setting, let us return to the case of a simplest 
/2-portfolio of the previous section. Recall that this is a bespoke composed of two "chunks" 
of two index portfolios (out of the total of four "chunks"). Let Xu and X 2 i be stochastic 
losses for "relevant" sub-portfolios LTn and n 2 i, respectively, with X\ 2 and X22 being losses 
in complement sub-portfolios II12 = Ili\IIii and H22 = Ii^n^i. Then we have the following 
obvious relations for the total losses X\ and X 2 for portfolios Hi and n 2 : 

Xi = X n + X u , X 2 = X 2 i + X 2 2 (5) 

The top-down specification of our approach amounts to the fact we only keep track of cumulative 
sub-portfolio losses X\ = (Xn,Xi 2 ) and X 2 = (X2i,X 22 ) (we use hereafter a compact vector 
notation) , but not default states of individual names in either the "relative" or "complement" 
sub-portfolios. As long as the composition of our bespoke is as described abovd^J the joint 
distribution of sub-portfolio losses An and X 2 \ is all we need to price the bespoke. 

The latter is easy to demonstrate. Consider a given time horizon T, and assume for the mo- 
ment that we have somehow managed to calibrate this market factor distribution together with 

conditional joint loss distributions of sub-portfolios of the index portfolios P\ \X\ii Xi 2 \z\ and 

9 Let us assume for simplicity that the number of steps of an optimization algorithm (e.g. a gradient-based 
one) needed to calibrate the model is proportional to N tra nche - the number of tranches in the calibration set. 
Then the total time needed to calibrate the model scales as T ~ N% ames N® arket N tr anche, where the first factor 
is due to ~ ^names operations needed to perform convolutions to calculate conditional loss distributions, while 
the second factor is due to fact that this calculation should be repeated NP^ ket time, where N mar k e t is the 
number of discretization points for one component of the factor, and D is the number of component. If we 
use e.g. a 10-point Gaussian quadrature for integration over the market factor (i.e. N mar k e t = 10), when we 
proceed from one factor setting with N names = 125 (i.e., one index), N tr anche = 6, D = 1 to a two-factor setting 
and two index portfolios, the computational time increases by a factor 4-ZV mar / cet = 40. 
^Generalizations will be introduced below. 
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Pi {x2\,X2i\Z^ for this horizon. We can now easily price any tranche referencing the bespoke 

portfolio I1 B = n n Uil 2 i. Indeed, as losses in two portfolios are conditionally independent, the 
conditional loss distribution is easily found by convolution 

P B (X n + X 21 = x\Z) = j dyP 1 (y\z) P 2 (x - y\z) (6) 

where conditional loss distributions Pi (Xn |i) and P 2 (x 2 i\Z} are obtained by marginaliza- 

tion of Pi ^X\\, X\2\Z^ and P 2 (^X 2 i, X 22 \Z^j over X\ 2 and X 22 , respectively. Assuming that 

the common market factor is a continuous random variable with pdf Pz(z), the unconditional 
loss distribution is then obtained as follows: 

p B (x,T) = J P B {x\z)P z {z)dz (7) 

Once the loss distribution for the bespoke portfolio is calculated on a grid {Tj} of time horizons, 
any bespoke tranche is priced by standard formulae using numerical integration, see below in 
Sect. 3. 3. 



3 IMFM in a single-period setting 

In this section we present the Implied Multi-Factor Bespoke Model (IMFM) in a simplified 
single-period setting, where we only deal with the losses of index portfolios and their sub- 
portfolios at a single time horizon T. Once worked out, this will serve as a seed for a dynamic 
multi-period extension which is introduced in the next section. 

3.1 A two-factor CreditMetrics prior 

As was mentioned above, while our formulation is straightforward to generalize to an arbitrary 
number of market factors, our basic formulation will deal with a specific case of two market 
factors. In component notation, the latent variable for the i-th obligor in the fc-th index portfolio 
takes the following form 

4 k) = + PgZt + yjl-($) 2 -($) 2 -Zpfflfflei (8) 

Here Z = (Zi,Z 2 ) is a two-dimensional Gaussian random variable with means 0, variances of 
one, and correlation p. 

While parameters of such model can be estimated from time series of spreads or defaults, we 
leave this task for future work, and instead adopt in this paper a simple parametrization that 
ensures consistency with a one-factor Gaussian copula framework. Assuming that the latter 
model is already estimated, our parametrization guarantees that no further work for parameter 
estimation is required. 

We concentrate on a basic setting of two index portfolios, 111 and Il 2 . We assume that 

4 1} = ^ii > P$? = (9) 
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In words, we assume that for each name, its factor loading to the "foreign" factor is a fixed 
proportion of its factor loading to its "domestic" factor. It is easy to see that, as long as we 
view portfolio 111 ( II2) in isolation from portfolio H2 (resp. Ill), the resulting model is identical 
to a one-factor model with factor loadings &• (or bf^) provided we set 



#1 



(i) 



b^ 



a/1 + lap + a 2 

l(2) 

/4 2) = . 1 (10) 

a/1 + lap + a 2 

Note that this framework slightly generalizes a more conventional hierarchical structure which 
is recovered with the present formalism in the limit a = 0. 

Our parametrization ^ ensures that pairwise asset correlations for names in the same 
portfolio are the same as in the one-factor model irrespective of values of p and a, e.g. for 
portfolio IIi we obtain 

P^=^^(l + 2ap + a>) = b^ (11) 
On the other hand, for two names in different portfolios we find 

pg" = wi + « 2 ) + *.) = ^f ( ;;°?^ a 2 ° a ^ < i2) 

Note that equality in the last expression follows in the limit a = corresponding to a hierarchi- 
cal model. Therefore, for a fixed p, a non-zero value a > enhances inter-sector correlations 



11 



In the implementation of the model, we discretize the market factor as follows. Using vector 
notation Z = (Z lt Z 2 ), the range of possible values of Z is discretized to a 2D grid 

where mi = 1, ... , N m \ and m 2 = 1, . . . , N m2 , and 

m = (mi, 777.2) , mi,m2 £ Z + (14) 

is a 2D vector of integers. Corresponding discretized probabilities of realizations of these values 
of the market factor on the grid {z mi , z m2 ] read 

hrh = h mit m 2 = P \Z\ = Z mi , Z 2 = Z m2 ] = P Z = Z^ (15) 

The difference the prior and "true" discretized market factor distributions thus amounts to 
different choice for weights {h^}. In the numerical examples to follows, we use a low number 
(between 10 and 20) of discretization points for each factor. 



n Note that while any given level of pairwise correlations obtained with a non-zero value of a can be reproduced 
with a — and a higher value of p, higher-order correlations (and hence tranche prices) in these two cases are 
different. 
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3.2 IMFM calibration via minimization of KL-divergence 



Our problem is to calculate the joint distribution of the 6 random variables yZ, Xi, X 2 j which 

is implied by the observed set of tranche price quotes. More exactly, we calibrate our model 
to a set of tranche expected losses at different maturities, which are obtained from market 
tranche prices using an arbitrage-free interpolation method. For the latter, one option is to 
use a spline-based method where no-arbitrage conditions are enforced as constraints, or using 
a top-down model such as BSLP [2] that does such interpolation internally. We do not expand 
on this procedure here, and instead refer the reader to the literature. 

The implied joint distribution of yZ, Xi,X 2 j is calculated as a minimal distortion (in the 

sense of information entropy, see below) of a "prior" distribution (see below for possible choices 
for the prior), with the distortion just sufficient in order to match a given set of tranche expected 
losses EL ki , where i — 1,2 stands for the credit index portfolio, and k = 1, . . . , iVj enumerates 
tranches referencing the i-th portfolio. In information theory (see e.g. [5]), the "distance" -type 
measure for two continuous distributions of a random variable Y with densities P\{y) and P2{y) 
is given by the celebrated Kullback-Leibler relative entropy (KL-divergence) 

D[P 2 \\P X \ = J dyP 2 (y)logj^ (16) 

It can be easily checked using Jensen's inequality that D[P2||Pi] is non-negative for all pairs 
(Pi,P 2 ), and reaches zero only when P2 = Pi. Note that for discrete distributions, integration 



in ( 16 ) should be substituted by summation. Having this in mind, in what follows we will keep 
for a while the continuous notation even for discrete distributions. 
In our setting, variable y appearing in (16) is 6-dimensional (6D)p^j 



Y=\Z,X h X 2 ) = [Z,Xj (17) 

where X = (XijX^) is a 4D vector storing all sub-portfolio losses in both index portfolios. 

The generalized KL-divergence for multi-dimensional distributions takes the same form as (16), 
provided integration (or summation) is extended over the whole D- dimensional space. 

In what follows we assume that we pick a particular "prior" model Q for the joint risk- neutral 

distribution of yZ,Xi,X 2 j: 

Q = Q(z, X u X 2 ) = Q (z, X) (18) 

We note that a few choices for the latter are popular among practitioners. One option is to 
use a uniform prior distribution corresponding to the state of maximum uncertainty about the 
value of Y. Another, and often more attractive choice is to use a prior measure corresponding 
to a specific parametric model. We will adopt a 2D Gaussian copula model with factor loadings 
defined as in ^ and (10) as the prior model in our approach* 
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12 Highcr dimensionality is obtained with either or both of finer portfolio partitioning or increased dimension- 
ality of the market factor. The case D = 6 seems to be the case of minimal dimensionality D needed for the 
bespoke problem. 

13 Alternatively, we could use the same model calibrated to historical data. 
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Following the Minimum Cross-Entropy Method (MCE)£j we seek an implied joint risk- 
neutral distribution P (z, X^j that minimizes the KL-divergence between the "true" distribu- 
tion P (z, Xj and the prior distribution Q yZ, X 



D 



P ( Z X i 

P (Z,X) \\Q (Z,X)\ = [ dZdXP (Z,X) log J (19) 

J Q ( Z,X 1 



(here dX = Yl i=1 2 dXi, and we use the integral notation for summation over the possible values 
of Z), subject to pricing constraints 

J dZdXp(z,X^)F ik (x} = EL ik , t = l,2, k = l... ,N i ,N i + l,N l + 2 (20) 

where for a given i, the first k = 1, . . . , iVj components of the generalized payoff functions 

F ik (^X j stand for the payoffs of the k-th tranche referencing the z-th index portfolio, while the 

last (iVj + l)-th and (JVj + 2)-th components correspond to the payoffs (total expected losses) 
for sub-portfolios II^ and IIj2, respectively: 

f {X a + X i2 -K i ^ 1 )+-(X il + X i2 -K ik )+ k = l,...,N i 
F ik (X) = I X a k = Ni + l 

{ X l2 k = Ni + 2 

where {Ki k } is a set of standard strikes for the i-th index portfolio. Generalized expected 



losses ELik are defined using a similar convention. Eqs.(20) plus the normalization condition 
J dZdXP {^Z, Xj = 1 constitute the full set of constraints imposed in our model. 

We follow the factor framework, i.e. we assume that Z is an unobservable "market" factor 
such that conditional on the value of Z = Z^, all individual defaults in both index portfolios 
IIi , n 2 become independent. Correspondingly, the joint probability of can be writ- 



ten as a product of the "market factor" probabilities (see Eq.(15)) and conditional loss 
distributions Pi (^Xi\fiCj = Pi \ Xi\Z = Z^ (i = 1, 2) of losses in two index portfolios: 

P[Z = Z A , X) = h A P (X|m) = h* JJ P l (X,|m) (21) 



8=1,2 



Similarly, we write the prior model in the factor form: 

Q [Z = Zk, X) = g*Q (X\rnj = J[ Qi (x|m) (22) 



8=1,2 



14 The MCE method generalizes the famous Maximum Entropy (MaxEnt) method. The latter is recovered 
from the former when one adopts a uniform prior. While the MaxEnt and MCE methods have been widely 
used since early 1980s in such fields as image processes, speech recognition etc. see e.g. [6], its first uses in the 
context of option modeling appeared around 1995-1996 in a series of papers by Avellaneda et al [3], Buchen 
and Kelly [I], Gulko [TU], and Stutzer [53]. For more recent applications for modeling credit portfolios, see e.g. 
Halperin [T3], Vacca [55] and Veremeyev et al [55] , 
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Substituting (21), (22) into (19), the KL-divergence can be written as follows 15 



D 



p[z,x)\\q{z,x)\ = "£ h * l °Z-+Y,Y, h * I d^iPi{^i\rh) log 



1=1.2 m 



P XAm 



Qi [Xi\m 



(23) 



The problem of minimization of the KL-divergence of distributions P [Z,Xj and Q [Z,X 



subject to constraints (20) can be solved in one of two ways. The first one is a direct functional 



minimization of KL-divergence in form (19) viewed as a functional of the joint probability 



P (m, X) = P [ZtfnX). The second, and equivalent, way is to use the expression (23) and 



minimize it jointly by viewing it as a functional of a prior independent distributions hrA and 
Pi{x\rf^. 

It is instructive to start with the second approach. We pose the variational optimization 
problem with the following Lagrangian functional: 



Urn / ^ ' 



P XAm 



Qi \Xi\rh 



hfa / dXi Pi X 4 |m F ik (X) - ELi 



(24) 



which should be minimized with respect to P\ \ Xi\m \ and Pi ^X 2 |mj. Here the first two 

terms enforce minimization of the KL-divergence between the "true" and prior joint distribu- 
tions of the market factor and losses, while the last term enforces matching the tranche pricing 
data in the least square sense ^ 

Note the relative importance of matching the data versus minimization of the KL "distance" 
to the prior distribution is controlled by parameters <7j. In particular, when these parameters 
are large, the "true" distribution is very close to the prior, while the quality of fit is poor. In 
the opposite limit Oi — > 0, one imposes an exact matching of pricing constraints. In this limit, 



the suitable Lagrangian reads, instead of (24) 



£ = ^2 h ™ lo § — + E hrA I d ^ Pi I 777 ) log 

Urn I \ ' 



P XAm 



Qi [XAm 



+ EM5>/<#fl(*i*)^ 

i.K \ m 



(X) - EL ik 



(25) 



where \ik are Lagrange multipliers. While such formulation corresponds to a more conventional 
version of the MCE/MaxEnt methods, it may potentially lead to unstable calibration and 



15 Here we switch to discrete summation over Z 

16 i.e. prior to taking into consideration constraints (20) 



17 For brevity, we have omitted additional terms in (23 1 corresponding to normalization constraints for distri 



butions of interest, however they will always be kept in mind in the calculation to follow. 
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overfitting, or even to no solution at all if constraints are incompatible. We therefore prefer 



the "soft constraint" version (24), where a compromise between robustness and quality of fit is 



expected to be achieved for some intermediate values of tTj. 



Unlike the "classical" MCE setting (25), a straightforward functional optimization of La- 



grangian (24) is diffucult because of its non-linearity. Fortunately, a simple method is available 



that allows one to get rid of these non-linearities at the price of introducing auxiliary variables. 
Following Ref.pJ], we consider the following modified Lagrangian 



£ = ^/i^log— + I dXiPi (Xi\mj lo, 



Pi IX Am 



Qi [XAm 



+ H Y* U ^ + 11 X ' k ( U ik-Y. h ™ [ d ^ Pi (^M) Fik ^ + ELik ) 

i,k Gik i,k \ rn / 



(26) 



It is easy to see that this modified Lagrangian is equivalent to the original one (24) if, in 
addition to minimization with respect to distributions h^, and Pi (Xj|m) ( i = 1,2), we also 



vary it with respect to parameters Un- and Xik- Indeed, if we first optimize (25) with respect 
to Xik, we obtain 

(27) 



V^^JiX^X^F^-EL* 



and back substitution in (26) leads to (24). The same result is obtained if we optimize wrt 
first, and Xik next. 



Having established equivalence of (26) and (24), we now want to proceed differently with 



(|26|). We start with minimization with respect to Pi (Xi|m) and P2 (X2|m). We find 

1 



Pi XAm 



Qi [Xi\fh) e^k^iF^x^-EL^) 



Zi(m, X) 



Zi(m, X) 

dXiQi (Xi\m) e ^M F ^- EL - 



1,2 



(2f 



Interestingly, Eqs.(|28|) indicate that even though we start with the factorized prior distribu- 
tions Qi \Xi\m 



- Qn (Xji|m) Q i2 (X i2 |m), we end up with non-factorizable conditional dis- 
tributions Pi ^Xj|mj due to the fact that payoff functions Fik depend on the sums Xji + X i2 

non-linearly. As discussed in details in Appendix, this both makes sense intuitively and can 
be explained within Information Theory. Therefore, while for the prior model losses in all four 
sub-portfolios Ilii,IIi2,n2i and n 22 were independent conditional on the market factor, as a 
result of calibration, they become dependent within the same parent index portfolio, while 
retaining independence across different index portfolios. 



Let us now proceed with the solution of optimization problem (24). Substituting (28) back 



in the Lagrangian (26), we obtain 



£ = ^/i^log— - J^/i^ ^ log Zi(m, A) + ^2 \ YJ U < 



ik + ^ikUik 



(29) 
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Minimizing this with respect to h^,, we find 

hm = Y(y\9fh \\ Zi(m, A) 

^ ' i=l,2 



(30) 



8=1,2 



Minimization of (29) with respect to Uq. gives 



Uik — —KkCik 



We next substitute the extremal values (30) and (31) into (29) to obtain 



C = -\ogZ(\)- 1 -J2 X >* 



(31) 



(32) 



ik 



The values of Lagrange multipliers should now be found numerically by maximization of (32), 
or equivalently, minimization of the Lagrangian 



(33) 



ik 



This is a convex optimization problem as the matrix of second derivatives is given by the 
covariance matrix of constraints 



d 2 C 



dXikX 



(F ik Fji) - (F ik )(Fji) + Sijdkia^ = cov(F ik , Fji) + <%<Wf 



(34) 



which is a positive-definite matrix. Therefore the solution is unique if it exists. 

Once the Lagrange multipliers are found, the solution of the problem is given by Eqs.(28) 
and pOj ), and the unconditional loss distribution for the bespoke portfolio is calculated using 
Eqs.gand Q. 



3.3 Pricing bespoke CDO tranches 

The analysis of the previous section applies to a given single time horizon T. To price tranches 
referencing a given bespoke portfolio, we need to pick a time grid {T^}. For each node on this 
grid, we find implied dichotomic index loss distributions for both reference indices as described 
above, and then calculate the implied loss distribution p B (x,Ti) for the bespoke as outlined 
in Sect. 2. 4, see Eq.([7]). After that, tranches on the bespoke CDO are priced using standard 
formulae that we provide here for completeness. 

Let Kd,K u be detachment/attachment points of the bespoke tranche. The term structure 
of tranche expected losses is then calculated as follows 

EL Ti = ELi = K \ J dxp B (x,T i ) [(x - K d ) + - (x - K u ) + ] (35) 
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The default leg (otherwise known as contingent leg) of the tranche is given by 



V = N f B(0, t)dEL t ~ N I + B i) ( EL i ~ EL ^) 
^° i=i 

where -8(0, t) is a risk-free discount factor. 

The premium leg (paid by the protection buyer to the protection seller) is given by 



V(S) = S-N J2&i(Bi-EN Ti - B(0,t)dENi 
~l V Jt^ x J-i- J-i-i 

M 

~ S-NoJ^^-iEN^ + ENi) 



i=l 



(36) 



(37) 



where S is the tranche spread, A, is the day count fraction, and 

ENi = EN Tt = 1 — ELi (38) 



is the expected tranche outstanding notional at time Tj. The integral term in (37) represents the 
accrued coupon due to defaults happening between the coupon payments dates. The integral 
is calculated using the standard approximation (see e.g. [19]) that amounts to substitution of 
(t — Tj_i)/(Tj — Tj_i) and B(0,t) by 1/2 and Bi, respectively. The fair (break-even) tranche 
par spread, S, is determined from the par equation T> = V(S). 



3.4 Discussion 

3.4.1 Inter-temporal consistency and time arbitrage 

The framework presented so far corresponds to what is known as "single-period" models in the 
literature. In this approach, one deals only with marginal loss distributions at a given set of 
maturities, but not with joint probabilities of losses at different time horizons. Note that the 
knowledge of marginal loss distributions is sufficient for pricing of CDO tranches. Reversing 
this argument, one can state that market prices of index tranches contain information on 
marginal loss distributions but not on joint inter-temporal distributions, see e.g. [2] for a 
related discussion. 

In the above framework, we treat different time horizons separately from each other, and 
calculate the implied distribution that matches a set of tranche expected losses (ELs) for each 
node Tj on a time grid. If the set of input tranche ELs is arbitrage-free across both strikes 
and time, then the resulting implied distribution will be free of arbitrage across time for loss 
levels corresponding to strikes in the calibration set. However, no-arbitrage across time is not 
guaranteed in this approach for other loss levels. 

While we have not encountered violations of time arbitrage in practice with our numerical 
experiments, the above implies that it may happen. Note that by continuity, once we calibrate 
to an arbitrage-free set of tranche ELs, the time no-arbitrage holds not only for reference 
strikes, but also for loss values around these strikes. Therefore, a simple practical way to 
prevent volation of time arbitrage is to increase the number of strikes in the calibration set. 
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A more principled approach to the problem of possible time arbitrage requires switching 
to a dynamic framework. Similar to the way no-arbitrage across strikes is guaranteed by the 
properties of the KL-divergence (see above in Sec.2.1), no-arbitrage across time is ensured once 
we pick an arbitrage-free prior model for transition probabilities. Such a construction will be 
presented in Sect. 4. 



3.4.2 What form of entropy minimization should one use? 

A couple of further comments are in order here. First, note that Eqs.(28) and (30) imply the 
following form of the "true" ("posterior") joint distribution P (jn, Xj: 



P X,Z, 



hr 



XAm 



1 



=1,2 



Z{\) 



Q[X,Z rn )e 



(39) 



which could equivalently be obtained by a direct minimization of the KL-divergence in the form 



(19), subject to constraints (20). This would provide a much shorter derivation of our final 



result (28), (30). This is required, of course, for self-consistency of the method. 



Second, the lengthy derivation leading to our equations (28), (30) was given above with the 



intent to illustrate the modelling freedom in the implied loss approaches which have recently 
become popular in the literature on credit portfolio modeling. Assume for the moment that 
instead of independent variation of (23) with respect to both market factor distribution and 

conditional loss distributions Pa 



Xi\mj, we would fix the functional form of, say, the latter, 

and only optimize with respect to the former (see e.g. in Rosen and Saunders [20J ) . It is clear 
from the above derivation that such a procedure would yield a sub-optimal (in the sense of 
KL-divergence) solution for the joint distribution of the market factor and sub-portfolio losses. 

The latter point is easy to illustrate a bit more formally. Assume, for the sake of argument, 
that we calibrate to a set of tranche expected losses for synthetic first loss (equity) tranches with 
payoff functions F^(x) = min {x,K^). Assume that we fix the conditional loss distributions to 

their prior form, Pi yXi\rfij = Qi yXi\rnj. In this case, minimization of the Lagrangian (24) 

with respect to yields 



Z(X) 



9rne 



■E« (/ dXiQi (Xi\nCj Fik(Xi) — E Lib*) 



(40) 



where Z(X) is a normalization factor, and Lagrange multipliers A^ minimize the Lagrangian 
function 



t" = logZ(A) = log J] e -lW/«tf«< 

V m £=1,2 



[XilrtAFikiXJ-ELi, 



(41) 



Comparing this Lagrangian to (33), we note that in the limit (7j — > 0, they have the same 



gradients and hence reach their minima at the same point A = A*. On the other hand, as long 
as functions e~ AifejFifc(l ) are convex in x, by Jensen's inequality we have, for any fixed vector A: 



-Eik>>ik F ik(Xi) < p-llik^ikSdXiQ^Xi^FiktXi) 



(42) 
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which implies that the minimum of C lies lower than the minimum of C" . This exactly means 



sub-optimality of the solution (40) in terms of KL-distance between the "true" and "prior" 
joint distributions of (Xi,mj. 

Finally, note that even though it might appear that allowing for independent variations of 
hfn and P [Xi\rh) brings "too much flexibility" to the problem, this is not true: adjustments to 



both htft and P [Xi\rh) are driven by the same set of Lagrange multipliers, and are equivalent 



to a single adjustment of the joint probability P ( Z^, X ) , see (39 ). In the nomenclature of fac- 



tor models, this means that the least biased distortion of a prior joint distribution Q yX, Z, 

is obtained when both the market factor distribution and conditional loss distribution are al- 
lowed to vary. The joint optimization of distribution Q (^X, Z^j has the same complexity as 

optimization of the market factor distribution alone. 

4 IMFM: a dynamic multi-period formulation 

Here we present a dynamic generalization of the formalism of the previous section to a multi- 



is 



period setting. We consider a (sufficiently dense) set of reference maturities T , T l5 . . . , 7at t -i 
We set T_i = to be today's time. Let 

X? = (X a (T n ),X l2 (T n )) , i = l,2,n = 0,...,N T -l (43) 

be the loss in the i-th portfolio at time t = T n , expressed as a tuple of losses in the "relevant" 
and complement sub-portfolios at the same time, respectively. Alternatively, we use a dynamic 
4D vector 

X n =(x 1 (T n ),X 2 (T n )) (44) 



to specify the joint sub-portfolio losses in the two index portfolios at time T n . 

We assume a dynamic framework for the 2D common market factor Z t = (Z±(t), Z 2 (t)), 



where the (time-dependent) components Zi(t), Z 2 (t) take values on the same grid (13). Fur- 
thermore, we assume that Z t is a right-continuous jump process with jumps allowed only at 
times {T n }: 

Z t = Z n = (Zx(T n ), Z 2 {T n )) , t E [T„_ 1; T n [ , n = 0, 1, . . . , N T - 1 (45) 



Similar to (15), we introduce the time- dependent integer- valued 2D vector m n that labels 
discrete states of the market factor components on the interval [T n _i,T n [: 

hrh» = P (z(t) = Zftn) = P (Zi(t) = z mi ,Z 2 (t) = z m2 ) , t e [T n , T n+ i[ , n = 0, 1, . . . , N T - 1 

(46) 

We assume a Markovian setting in our model, i.e. the state variables at time T n depend only 
on state variables at previous time T n _i but not on their prior history: 

P (x n , m n \X ..., X n ~\ m°, . . . , m™- 1 ) = P (x n , m"^ 1 .^ 1 ) (47) 



18 In practice, maturities {T n } may be taken e.g. with annual steps. 
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We now construct a multi-period generalization of the single-period setting of the previous 
section, where one infers transition probabilities P (^X n , m n \X n ~ 1 , m"" 1 J instead of marginal 

probabilities P ^X n ,m n j. The model is constructed (equivalently, calibrated) within a boot- 
strap procedure in the time dimension, starting with the first maturity To. In treating the time 
dimension of the problem, our approach is similar to that developed in Ref. [IT] in a somewhat 
different context of pricing equity derivatives. 



4.1 The first maturity Tq 

For the first maturity To, we assume that today (t = 0, or equivalently n = —1) we have no 
losses in either portfolio, and the value of the market factor prior to time T_i is fixed somehow 
(this value is irrelevant for calculation of the joint law of X(t),m(t) at time t = T ). The 

marginal joint distribution P ^X°,m°j is then calculated using the formalism of the previous 



section, see Eq.(39) 



(4? 



where Q^yZ^o, X°J is the prior joint distribution of the state variables at time T . 
4.2 Further maturities T n , n = 1, . . . , Nt 

Now we assume that we have calculated the implied joint distribution for the previous maturity 
T n _i, and we want to move one step on the time grid, i.e. calculate the transition probabilities 

P (^X n ,m n \X n ~ 1 ,rn n ~ 1 ^j. The latter should be calculated (calibrated) in a way that ensures 
consistency with pricing data for maturity T\ as seen today, at time t = 0. 

Our method to calculate transition probabilities P (^X n , m n \X n ~ 1 , m n_1 ^ is based on mini- 
mizing of a suitable KL-divergence, similar to the procedure used above in the single-period set- 
ting. The only difference from the previous case is that now we have to minimize the conditional 
cross entropy of transition probabilities P (^X n , fh n \X n ~ l , m n_1 j and Q (^X n , rh n \X n ~ 1 , m n_1 j . 
The latter is defined as follows (see e.g. [6]): 



D[P\\Q)= ! dX n ~ l dX n P ^X n ,m n ,X n -\m n ~ 1 ^j log— j 



P [X n M n \X n -\m n - 1 



Q [X n ,m n \X n - 1 ,m n - 1 



(49) 



which can be interpreted as the KL-divergence of transition probabilities P (^X n , rh n \X n 1 ,m n 1 
and Q (^X n , m n \X n ~ l , fn n ~ 1 j viewed as a function of initial values X n ~ 1 ,m n ~ 1 , and averaged 
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over these initial values using the marginal joint probability distribution P [^X n 1 , m n 1 
culated at the previous time step: 

D [P\\Q] = I dX^P (X n ~\m n -A D^"- 1 ,^ 1 ) 
Introducing transition probabilities for the market factors 



h (m n \m n - 1 ,X n ^ 1 '\ = P 



Z — Zffan I Z 



n-1 



n-1 



and the corresponding prior probabilities 



cal- 



(50) 



(51) 



g[m n \m n -\X n - 1 



p(prior) 



Z — Zry, n Z 



n-1 



Ztfjn-1 , X 



n— 1 



(52) 



the conditional KL-divergence (50) can be put in a more suggestive form (compare with (23)): 



D[P\\Q] = Yl / d * n ~ lp 



X n ~\m n - 



1 J2 h (m n \m n -\X n - 1 ) l(u 



h (m n |m n - 1 ,X n - 1 



g ( m n \m n - 1 , 1"" 1 



P [X^rh 71 ^ 11 - 1 ,™ 71 - 1 



(53) 



+ / dX n P ( X n | m n ,X n ~\m n - 1 ) log - 

Our pricing constraints now take the following form: 

[ dX n - x P (X"- 1 ,^" 1 ) [ d * n P (X",^"!^™- 1 ,^"- 1 ) F ik (X n ) = ELI (54) 

While the problem of minimization of (53) subject to constraints (54) admit more general 
priors, in what follows we restrict ourselves to a particular sort of priors. First we assume that 
prior transition probabilities for the market factor depend only on the previous value of the 
market factor but not on the previous loss level: 



g [m n \m n -\X n - 1 



9( 



n I -?n— 1 



m \m 



) 



(55) 



A possible candidate for the prior market factor transition matrix can be a simple birth-and- 
death discrete-time Markov chain. 

Second, we assume that the probability of reaching loss X n by the end of period [T n _i,T n [ 
depends only on previous losses X n ~ x and the value fh n of the market factor on this interval, 
but not on its previous value: 



Qi [X^\m n ,X^\m n -' 



Qi (A7|m",A7- J 



(56) 



Within such reduced class of admissible priors, for the latter we can use simple single-period 
models such as a two-factor Gaussian copula (or the RFL model) that we used above in a static 
setting of Sect. 3. 
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A brief inspection of Eqs.(53) and (54) establishes that the results of a corresponding La- 



grangian optimization problem of minimization of (53) subject to constraints (54) can be im- 



mediately read off the previous formulae for a single-period case, provided all marginal prob- 
abilities should substituted by transition probabilities of transition from the previous state 

^X^ 1 , Z n ~ l \ . This observation allows one to immediately write down the result (which can of 

course be readily checked following steps similar to those described above for the single-period 
setting): 



1 



ZK \rh n ,X] 



n-l 



h 



X 



n-l 



Zl (m n -\X n - 1 



j dX?Q t (x^wP^Xr 1 ) e^MW")-^ fe ) 

Zl(X"-\m^) iti,2 V 

(m-lm- 1 ) (Xr\m n ) Z n 2X (x^\ 



(57) 



n-l -n 



Note that the first of Eqs.(57) shows that our framework is arbitrage-free as long as the prior 
model is, irrespective of the model g (rh n \rh n ~ 1 ) for the dynamic market factor. Furthermore, 
the third of Eqs.(57) demonstrates that our model produces credit contagion: even though we 
started with a prior transition probabilities g (m n |m n_1 ) that were independent of the previous 

loss levels X" -1 , the "true" transition probabilities h {m n \fh n ~ 1 , X n ~ l j do depend on them. 

Finally, note that Eqs.(57) can also be combined into a formula for joint transition probabilities 



P {m n ,X n \m n ~\X n - 1 



— 1 (m n ,X n \m n -\X n - 1 ) e^ x ^ F ^~ EL ^ 

^\m n - 1 ,X n - 1 ) V J 



(5f 

Here the values of Lagrange multipliers Xik should be substituted by the solution of a (N\ + 
N 2 + 4)-dimensional convex optimization problem with the Lagrangian function (compare with 



Eq.(33)) 



m™- 1 ik 



\2 2 
A ik°ik 



(59) 



4.3 Completing one step of time bootstrap 

Having computed the transition probability P 



X n ,m n \X n -\m n - 1 



and using the marginal 



probability P 



X n-1 rfn-l 



known from the previous time step, we know use them to calculate 



the marginal joint distribution of state variables at time 7*.: 

P[ni".X") = I 'IX'-tp {iiY'-^.X"- 1 ) P (m".. X^nY'-^X"- 1 



(60) 
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Unless we reached the final maturity T^ k on the time grid, we know increment k — > k + 1, 
coming up with a problem identical to that just solved for the previous period, where (60) 
plays the role of the marginal joint distribution from the previous step. 



5 Generalizations 

5.1 More market factors 

Formally generalization to more than two market factors is straightforward and follows the same 
lines as developed above. However, for higher number of factors (three and more?) Monte Carlo 
may be preferred for integration over market factor distribution as well as for calculation of 
conditional loss distributions. 

5.2 More index chunks in the bespoke 

Assume that we have to price a tranche on a bespoke composed of three chunks belonging 
to three different indices {ii}f =1 . Such calculation would proceed along the same line as de- 
scribed above, with a simultaneous calibration to tranches referencing all indices Jj, while the 
conditional loss distribution ^ would now be given by a two-dimensional convolution instead 
of a one-dimensional one. Note that for this case a three-factor version of the model may be 
preferred to a two-factor one. 

5.3 Bespoke portfolios with "bespoke" names 

So far, we have assumed that each name in a bespoke portfolio belongs to some credit index. 
In practice, this is often not the majority of actual bespoke portfolios typically have 

some "bespoke" names, i.e. names that do not belong to any credit index. 

Let us assume that we are given a bespoke portfolio of the following form: U B = n n U II 21 , 
where IIn stands for a sub-set of names from an index portfolio I\ (e.g. CDX.NA.IG) and 
H21 is a sub-set of "bespoke" names that do not belong in any credit index. For example, the 
sub-set H21 can be composed of non-investment grade US names. In this case, we would take 
the CDX.NA.HY portfolio is the closest second benchmark security to our bespoke set. 

More generally, assume that names in H 21 are similar in terms of their geographic sectors 
and ratings to names in another index J 2 . We can then pose (and solve) the problem of 
finding a sub-set U21 of names from J 2 that provides the best approximation of the set U21 
in terms of sector diversification, average expected loss, and possibly higher moments of the 
loss distribution. This is a combinatorial optimization problem that can be solved e.g. using 
a genetic algorithm (details will be given elsewhere). Once such a proxy index sub-portfolio 
is found, our problem of pricing bespoke tranches can be reduced to the previous problem of 
pricing tranches on a 72-type bespoke portfolio. 

The approximating sub-portfolio U21 is generally expected to have an average expected loss 
(EL) that is not exactly equal to the EL of the original set n 2 i. This could lead to a mispricing 
of bespoke tranches. However, this can be easily cured by adjusting the loss distribution found 
for the sub-portfolio n 2 i to match the expected loss of n 2 i- To this end, we use the MCE 
method once again. 



24 



Let P (X\m) be the "true" conditional loss distribution of the set H21, and EL be its 
total expected loss. Further let Q (X\m) and {h^} be the conditional loss distribution of the 
approximating sub-portfolio H21 and the implied market factor distribution, respectively. We 
assume that Q (X\m) and {h^} are calculated as described in Sect. 3. We assume that the 
net difference between loss distribution of sets n 21 and n 2 i amounts to an adjustment of the 
conditional loss distribution, while keeping the market factor distribution intact. This leads to 
a variational problem with the following Lagrangian function 



£ = X>™ J dX P (X\m) log P +Af^>„ J dXXP(X\m)-EL 



(61) 



where A is a Lagrange multiplier enforcing the expected loss constraint. Minimizing this wrt 
P (X\m), we obtain 

P (X\m) = -±—Q (X\m) e"^"^ (62) 
where Z^(X) is a normalization factor: 

Z A {\) = J dXQ (X\m) e - x{x - EL) (63) 

By substituting (62) back into (61) and changing the sign £ — > —C , we obtain 

£ = ^/i* log ^(A) (64) 



The value of A should be found by minimization of (64). As before, this is a convex problem, 
therefore it has a unique solution that can be obtained in about a second. 

To summarize, the procedure of calibrating a model for a bespoke portfolio that has both 
names from an index portfolio I\ and names similar (but not identical) to some names from an 
index portfolio I2 amounts to three separate and consecutive optimizations: 

• Find an optimal sub-portfolio n 2 i of portfolio 7 2 that approximates the "bespoke" set of 
names n 2 i in the bespoke portfolio 

• Use the MCE procedure of Sect. 4 to calculate implied loss distributions in index sub- 
portfolios 



Perform one-dimensional minimization of function (64) to find the optimal value A = A* 



This value is then used in (62) to calculate an adjusted conditional loss distribution for 



the approximating sub-portfolio. 
After that, bespoke tranches can be priced following steps described in Sect. 2. 4. 



6 Numerical examples 

Here we illustrate performance of the model and the difference of the resulting prices from 
those obtained with the Base Correlation method using two sets of artificial bespoke portfolios, 
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Figure 3: Tranche quotes for CDX.NA.IG11 on 04/15/09. 



both made of names that belong in index portfolios. We will only present results obtained with 
a pseudo-dynamic version of the model described in Sect. 3, results obtained with a dynamic 
version will be reported elsewhere. 

Our first example deals with a bespoke portfolio composed of 50 highest spread names from 
CDX.NA.IG11 and 50 highest spread names from iTraxx Europe 9 index portfolios. The pricing 
date is 04/15/2009. Quotes for tranches on CDX.NA.IG11 and iTraxx Europe S9 indices are 
shown in Fig. [3] and Fig. |4| respectively. 

We calibrate the model to a set of tranche expected losses obtained using BSLP model 
of Ref. [2] (Results obtained using calibration to base correlation quotes will be reported be- 
low.). Calibration results are shown in Tables [l] and [2j where we display relative errors 
between the calculated and input tranche ELs obtained in the joint calibration of the model 
to CDX.NA.IG11 and iTraxx Europe S9 data. Note that as losses in senior tranches at short 
maturities are very small, large relative errors observed for these tranches give rise to very small 
pricing error in terms of tranche par spreads, thus rendering calibration nearly perfect. The 
implied distribution of the market factor is shown in Fig. [5j 



26 



Lower 


Upper 




Upfront 




Strike 


Strike 


Matu rity 


Payment 


Fee (bps) 


09b 


3 009b 


20-J111-1 1 


51 999b 


500 OO 


3 O0% 


6 Q09b 


20-J111-II 


16 35% 


500 OO 


6 009b 


9 009b 


20-Jui-l 1 


3 249b 


500 OO 


9 00% 


12 009b 


Oil 1. n -f <* 

mm IM I I 1 


O 009b 


31 1 OO 


12 009b 


00% 


j' * ■ 1 U I ■ 1 


O 009b 


106 OO 


?y 00% 




C>TI _li 1 1 


O 009b 


56 OO 


09b 




OO— _ll Bl-1 -I 

■ * " 1 j i_m ■ ■ 1 


O 009b 


205 OO 




3 00% 


20-Jui-13 


62 509b 


500 OO 


3 00% 


6 009b 


20-Jitv13 


24 509b 


500 OO 


6 O09b 


9 009b 


20-Jliv13 


5 8B9b 


500 OO 


9 00% 


12 009b 


Oil 1. n 1 "J 


O 009b 


346 OO 


12 CM)*K> 


009b 




O 009b 


136 OO 


99 00% 




1 «J \ MM I I J" 


O 009b 


60 OO 


0% 


IOO 00% 


20-JU1-13 


O 00% 


ISO 00 


0% 


3 00% 


20-Jui-15 


65 75% 


500 OO 


3 QO% 


6 009b 


20-Jitv15 


31 25% 


500 OO 


6 009b 


9 009b 


I in-1 *i 


IO 389b 


500 OO 


9 00% 


12 CKJ9b 


^ j ■ 1 1 ■ ■ - > 


O 009b 


378 OO 


12_QO% 


22_00% 


>t) .lift 1f> 


O 00% 


149 OO 


22.009fc 


100.00% 


20 1'> 


0.00% 


59.00 


0% 


100.00% 


20-Jim-15 


0.00% 


1 66.OO 


0% 


3_00% 


20-JW-1S 


TO_25% 


500.00 


3.00% 


6.00% 


20-JLB1-18 


36.63% 


500.00 


0.00% 


9.00% 


20-JW-18 


15.25% 


500.00 


900% 


12 00% 


20-JW-18 


000% 


44600 


12_00% 


22 00% 


.llMl IK 


000% 


161 .OO 


22.009fc 


100.00% 


2)KJuv18 


0.00% 


65.00 


0% 


100.00% 




0.00% 


161 .OO 



Figure 4: Tranche quotes for iTraxx Europe S9 on 04/15/09. 
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1Y 


-0.010 
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- 0.024 


- 0.040 


- 0.468 


0.005 


- 0.004 


3Y 


-0.000 


- 0.002 


0.000 


0.003 


0.003 
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0.0163 


0.006 


0.018 
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5Y 
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Table 1: Relative errors ^ (in percent) between input ELs for CDX.NA.IG11 and 

calculated model values obtained with joint calibration to CDX.NA.IG11 and iTraxx Europe 9 
data on 04/15/2009. 
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Implied market factor distribution 




Figure 5: Implied distribution of the market factor on 04/15/09, calibration to CDX.NA.IG11 
and iTraxx Europe S9. 
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Table 2: Relative errors — msM. — . mssi ([ n percent) between input ELs for iTraxx Europe 9 and 
calculated model values obtained with joint calibration to CDX.NA.IG11 and iTraxx Europe 9 
data on 04/15/2009. 
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Tranche Base correlation IMFM, case 1 IMFM, case 2 
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7% 
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0.649 
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3.948 
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0.482 


1229.6 


3.835 
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329.65 


4.507 


0.149 


409.2 


4.524 


0.185 


403.4 


4.538 


0.183 


30% 


100% 


130.15 


4.407 


0.057 


63.4 


4.786 


0.030 


63.6 


4.785 


0.031 



Table 3: Pricing of tranches on the mixed CDX IGll-iTraxx9 bespoke portfolio with 5Y to 
maturity on 04/15/09. Case 1: p = 0.5, a = 0.3. Case 2: p = 0.75, a = 0.0. 
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Figure 6: Equivalent correlation skew for the mixed CDX IGll-iTraxx9 bespoke portfolio priced 
on 04/15/09. 




The result of pricing tranches referencing our bespoke portfolio are shown in Table [3] where 
strikes are chosen to coincide with standard strikes of the CDX.NA.IG portfolio. We show the 
results obtained with two sets of correlation parameters: for Case 1, we use p = 0.5, a = 0.3, 
and for Case 2, we set p = 0.75, a = 0.0. Note that inter-sector correlations are decreased in 



the second case by 10% relatively to the first one, see Eq.(12). Therefore, the effect of transition 
from Case 1 to Case 2 is as expected: junior spreads go up, while senior spreads go down^ 
On the other hand, for both choices of correlation parameters we find that senior mezzanine 
and junior senior tranches are substantially more expensive in our model than in the Base 
correlation approach. We can also convert prices into equivalent base correlations, see Fig. 161 



19 Note that, as we enforce portfolio loss constraints together with tranche expected loss constraints, we do not 
include super-senior tranches in our calibration set in order to avoid linear dependencies between constraints 
(which would follow otherwise as the sum of all tranche losses should equal the portfolio loss). As a result, the 
impact of changed inter-sector correlation on the price of a super-senior tranche is less pronounced. 
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Tranche Base correlation IMFM, case 1 IMFM, case 2 
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Table 4: Pricing of tranches on the mixed CDX IGll-iTraxx9 bespoke portfolio with 5Y to 
maturity on 04/15/09. Case 1: calibration to ELs generated by Base correlation model. Case 
2: calibration to ELs generated by BSLP model. For both cases, p = 0.75 and a = 0. 
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Table 5: Relative errors ^ , (in percent) between input ELs for CDX.NA.IG11 and 
calculated model values obtained with joint calibration to CDX.NA.IG11 and CDX.NA.HY10 
data on 05/15/2009. 

To test the impact of different interpolation schemes used to calculate tranche expected 
losses on the time grid, we have performed comparison of results obtained with calibration of 
IMFM to tranche ELs obtained with the Base correlation (BC) model instead of ELs generated 
by BSLP. The results are shown in Table |4} It turns out that calibration to BC-generated 
tranche ELs produces larger pricing errors for short time maturities comparing to calibration 
to BSLP-generated ELs. The resulting bespoke tranche prices are however reasonably close to 
numbers obtained with the former method (we skip the results to save space). 

Our second bespoke portfolio is a mixed portfolio of 125 US investment and non-investment 
grade names, priced on 05/15/09. It has 90 highest spread names from CDX.NA.IG11, while 
35 lowest spread names from this index are substituted by 35 lowest spread names from 
CDX.NA.HY10 index. Market quotes on index tranches are shown in Figs. [7] and [8] 

Calibration results are shown in Tables [5] and |6j where we display relative errors be- 
tween the calculated and input tranche ELs obtained in the joint calibration of the model to 
CDX.NA.IG11 and CDX.NA.HY10 data. Similar to the above, large relative errors observed 
at short maturities do not have a material impact on the quality of calibration because losses in 
senior tranches at short maturities are very small anyway. The implied distribution of the mar- 
ket factor for this case is shown in Fig. [9j Unlike previous case, we now observe a pronounced 
bi-modal shape of the implied distribution. 
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Figure 7: Tranche quotes for CDX.NA.IG11 on 05/15/09. 
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Table 6: Relative errors m fl (in percent) between input ELs for CDX.NA.HY10 and 

calculated model values obtained with joint calibration to CDX.NA.IG11 and CDX.NA.HY10 
data on 05/15/2009. 



31 



Lowe r 


U ppe r 




Upfront 




Strike 


Strike 


Matu rity 


Payment (%) 


Fee (bps) 


o% 


3.0O9b 


20-Jl*l-11 


99.00% 


O.OO 


3 00% 


8 449b 


20JWV11 


9350% 


O OO 


8 44% 


1390% 


20JUI-11 


7421% 


500 OO 


1390% 


1931% 


^<J 1 1 


43 79% 


500 OO 


19.31% 


24.759b 


20J«v11 


25.19% 


500.00 


24.75% 


30.209b 


20-JIM1-11 


10.81% 


500.00 


30 20% 


100 00% 


20JWV11 


O OD% 


322 50 


0% 


3.00% 


syi .u»i i s 


99.00% 


O.OO 


3.00% 


8.44% 


20-Jl*l-13 


98.38% 


O.OO 


8.449b 


13.90% 


2QJl«v13 


78.29% 


500.00 


1390% 


1931% 


20JUI-13 


54 71% 


500 OO 


1931% 


24 75% 


^<J Jim 1 ( 


40 28% 


500 OO 


24.75% 


30.20% 


syi .u»i i s 


25.22% 


500.00 


30.209b 


100.00% 


20-Jl*l-13 


0.00% 


477.50 


0% 


3 00% 


20JUI-15 


99 38% 


O OO 


3 00% 


8 44% 


20JUI-15 


97 75% 


O OO 


8 449b 


13 ao% 


20 Mi 


7520% 


500 OO 


13.90% 


19.31% 


20 Jim 1'» 


60.30% 


500.00 


19.319b 


24.75% 


20-Jl*l-15 


51.25% 


500.00 


24 759b 


30 20% 


20JUV15 


31 00% 


500 OO 


3D 20% 


100 00% 


2DJUV15 


O 00% 


459 OO 


09b 


3 00% 


^<J Jim 1!i 


99 50% 


O OO 


3.009b 


8.44% 


2QJl«v18 


98.73% 


O.OO 


8.449b 


13.90% 


20-Jim-18 


82.50% 


500.00 


1390% 


1931% 


20JUV18 


76 25% 


500 OO 


19.319b 


24.75% 


syi jim iM 


72.94% 


500.00 


24.759b 


30.20% 


20-Jim-18 


38.81% 


500.00 


30 20% 


100 00% 


20Jm-18 


O OD% 


617 SO 



Figure 8: Tranche quotes for CDX.NA.HY10 on 05/15/09. 
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Figure 9: Implied distribution of the market factor on 05/15/09, calibration to CDX.NA.IG11 
and CDX.NA.HY10. 
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Table 7: Pricing of tranches on the mixed CDX IG11-CDX HY10 bespoke portfolio with 5Y to 
maturity on 05/15/09. Case hp — 0.5, a = 0.3. Case 2: p = 0.75, a = 0.0. 



Results of pricing tranches on this portfolio with maturity of 5Y are shown in Table [7j 
Note that qualitatively the difference in resulting numbers between IMFM and BC is similar 
to results found for our previous example of a mixed CDX.IG-iTraxx bespoke portfolio, i.e. the 
largest discrepancy between the two models is obtained for senior mezzanine and junior senior 



tranches. Equivalent correlation skew in shown in Fig. 10 



7 Conclusion 

In this paper we have presented the Implied Multi-Factor model (IMFM) - a semi-parametric 
hybrid bottom-up/top-down model designed for pricing and risk management of tranches on 
bespoke portfolios, as well as other, more exotic derivatives referencing bespoke portfolios. The 
model ensures no-arbitrage in the bespoke pricing, and eliminates the need for ad hoc "base 
correlation mapping rules" that are often used by practitioners to price bespoke CDO tranches. 
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Figure 10: Equivalent correlation skew for the mixed CDX IG11-CDX HY10 bespoke portfolio 
priced on 05/15/09. 



The alternative suggested in this paper is to use instead standard tools of probability theory 
in combination with the Minimum Cross Entropy (MCE) method of statistical inference. The 
latter is employed to infer, an a least biased way, loss distributions of index portfolio implied 
by available prices of tranches referencing these portfolios. 

Our framework differs from most of more traditional models in a number of points. First, our 
approach is semi-parametric, allowing one to automatically adjust the number of free parame- 
ters (and calibrate them) once the size of calibration set is changed. This produces a flexible 
and accurate calibration. In addition, this approach can be easily adjusted to accomodate e.g. 
quotes on sub-portfolios of index portfolios (or even tranches referencing bespoke portfolios) if 
traders are willing to provide such quotes. Second, the model is able to control the relative 
importance of fitting the data versus proximity to the "prior" model, thus providing means 
to avoid possible overfitting, or to find a solution in sutuations where constraints cannot be 
satisfied exactly. Third, calibration in our IMFM framework amounts to convex optimization 
in a low dimensional space. Therefore, the model is computationally efficient and calibrates 
within minutes rather than hours which we would expect for a more traditional bottom-up 
model with a multi-factor structure. 

Last but not least, convexity of the calibration problem in our model leads to a unique 
solution. Recall that more traditional bottom-up approaches to modeling credit portfolios 
typically give rise to objective functions having multiple local minima. When the model is 
re-calibrated from one day to another, the presence of local minima poses a problem, as a local 
search algorithm typically used for calibration might end up in a different local minimum from 
that found a day earlier, leading to unstable calibration and hedge ratios. Our model is free of 
such potential pitfalls. 

We have formulated two versions of the model. The first, single-period ("static") version 
can be used for pricing bespoke CDO tranches. The second, dynamic version of the model can 
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be used to price and risk-manage other, more exotic portfolio derivatives, such as e.g. forward- 
starting tranches or tranche options. Another potential application for the dynamic version of 
our model is the counterparty risk management (including, in particular, CVA calculations) for 
CDO tranches. Because the model is low-dimensional and Markovian, efficient lattice or tree 
implementations are possible. 

The approach presented in this paper is readily generalizable and extensible. Even though 
we presented details and numerical examples only for the basic case involving two market 
factors and two reference index portfolios for a given bespoke portfolio, we have shown how to 
generalize this setting to more complex cases including more reference portfolio, more market 
factors, and handling names not belonging in any index portfolio. Further extensions including 
risk management of bespoke tranches and modeling of price uncertainty of bespoke tranches 
due to bid/ask spreads of index tranches will be presented elsewhere. 



Appendix: Mutual information and portfolio loss parti- 
tioning 



As was mentioned above, the solution (28) indicates that the conditional independence of losses 
in sub-portfolios does not hold for the 11 posterior 11 model that adjust the "prior" distributions 
so that pricing constraints referencing portfolios as a whole are respected. In this section, we 
discuss this phenomenon in more details. 

We start with an intuitive explanation. Assume we have two subportfolios rin and II12 of 
an index portfolio III. whose losses X = Xu and Y = A 12 are assumed to be independent a 
priori. Obviously, if we know the loss L in the whole portfolio with certainty, L = L , then 
instead of independence of X and Y we have a deterministic dependence between them as now 
Y = Lq—X. By continuity, we should expect that when instead of knowing losses with certainty 
we only know the expectation of L (as L should generally be treated as a random variable), we 
should expect a probabilistic dependence between X and Y, because the previous deterministic 
case would be recovered in the limit when the loss distribution is a delta function centered at 
Lq: (L) = J dL Lp(L) — > J dL L5(L — Lq) = Lq. This implies that the more information (less 
uncertainty) we have about the distribution of L, the more informative X and Y should be 
about each other. 

Let us make this intuitive argument a bit more formal. A model-free information-theoretic 
measure of dependence between two random variables X and Y is the mutual information 

p(x,y) f , , / / 1 \ -i p(y\x) 



I(X;Y) = E p{X:V) 



\ P(X,Y) 
° g p(X)p(Y) 



dxdyp(x, y) log p ^ p ^ = J dxdy p(x)p{y\x) log 



p(y) 



dxdyp(x)p(y\x) log j^(^7 (B-1) 

In particular, it follows by Jensen's inequality that I(X; Y) > 0, while the equality is reached 
iff the two variables are independent, i.e. p(x,y) = p(x)p(y), see e.g. [6J. It is easy to check 



that unconstrained minimization of (B.l) with respect to p(y\x) yields a factorized solution. 



Indeed, the variational derivative of ( |B.l j reads 



SI 1 \ 1 p(y\ x ) ( m P(x,y) , . 

p(x) log — -— = p(x) log — t\— TT (B-2) 



5p(y\x) p(y) p(x)p(y) 



35 



which vanishes when p(y\x) = p(y) or equivalently p(x,y) = p(x)p(y). 

Let us now assume we are given constraints in the form of expectations (Fi(x + y)) p { x ,y) = 
C{. The minimum mutual information consistent with these constraints should be found by 
minimization of the following Lagrangian 

C = I(X-Y)-Y J ^(yf dxdyp{x)p{y\x)F i {x + y)-c}j - J dx^) {^J dyp{y\x)-l 

(B.3) 

where Aj and £(x) are Lagrange multipliers. This variational problem has the formal solution 

p{y\x) = p(y)e^< ^ F ^+v)+^) (B.4) 

While this solution is only a formal one as p(y) in turn depends on p(y\x) by the marginaliza- 
tion condition p(y) = J dx p{x)p{y\x) 20 it clearly shows that unless all \ = 0, the conditional 
probability p(y\x) does depend on x (as long as functions F,i{x) are non-linear), i.e. the in- 
dependence solution p(x, y) = p(x)p(y) cannot hold anymore. Physically, this means that the 
knowledge of expectations of functions of the form Fi(X + Y) carries information on interac- 
tion between two sub-portfolios (i.e. has a non- vanishing information value), and is in general 
incompatible with the independence assumption^} 

In our setting involving the market factor Z = described by a set of discretized prob- 
abilities {h^}, another relevant object is the conditional mutual information of sub-portfolio 
losses X = Xn and Y = X%\ given Z^. 

P(x, y\rh) 



I{X\Y\Z) = E 



, P(X,Y\Z) 
° g p(X\Z)p(Y\Z) 



^2 ha, J dxdyP(x,y\m) log 



Px{x\m)PY(y\m) 



J2 h ™ I dxdyPx(x\m)P Y (y\x,m)\og ^^j^- t ^, - (B.5) 

L — ' / I Fx\x \m)rY\y\x\m)dx' 

m 



The same calculation with (B.5) results in factorization of conditional probabilities P(x, y\m) = 
Px{x\m)PY(y\m), i.e. in conditional independence. 

Coming back to the unconstrained case, let us continue with the example of two random 
variables X and Y discussed above, which will now be identified with losses in sub-portfolios 
Tlx and Ily, respectively. We assume that X and Y are independent a priori, which is the 
case when we condition on the market factor Z^\ As we have seen above, unconstrained 
minimization of mutual information (MI) I(X; Y) yields the factorization (independence) result 
p(x,y) = p(x)p(y), but what should we take for marginal distributions p(x) and p(y)7 Clearly, 
if we have some initial guesses (e.g. obtained with historical calibration) q(x) and q(y) for 
the marginals, it is natural to expect that in the absence of any constraints we should have 
p(x) = q(x), p(y) = q(y) and p(x,y) = p(x)p(y) = q(x)q(y). Formally, we can achieve this by 
minimization of the following functional 

F[p(x),p(y\x)] = I(X- Y) + D[p(y) \\q{y)} + D[p(x) | \q{x)} (B.6) 



20 If desired, the actual solution can be obtained using the Blahut-Arimoto alternating minimization algorithm, 
see [B] 

21 A similar observation was also made in [9] in the context of a related Minimum Information principle for 
machine learning. 

22 Explicit dependence on Z is not important for the sake of discussion in this section, and thus will be omitted 
below. 
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where D[p||g] is the KL distance between two distributions p and q. Indeed, as all terms here 
are non-negative, the whole expression is minimized when each term vanishes, which happens 
exactly for the case p(x) = q(x), p(y) = q(y) and p(x,y) = p(x)p(y) = q(x)q(y). We can now 



combine the first two terms in (B.6) to get a more explicit expression 



/p(y\x\ I p(xj 
dxdy p{x)p{y\x)\og + J dxp{x) log = D\p(x, y)\\q{x)q{y)\ 

^ ^ (B.7) 

In other words, we ended up with the objective function which is nothing but the KL distance 
between the unknown distribution p{x) and a factorized "prior" distribution q(x,y) = q(x)q(y). 
This provides a justification to the method used in the main text, where we mimimize KL dis- 
tances between conditional joint loss distributions of sub-portfolios and prior joint distributions, 
conditional on pricing constraints that reference portfolios as a whole. 
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